A Computational Approach to Qualitative Analysis in Large Textual Datasets
نویسنده
چکیده
In this paper I introduce computational techniques to extend qualitative analysis into the study of large textual datasets. I demonstrate these techniques by using probabilistic topic modeling to analyze a broad sample of 14,952 documents published in major American newspapers from 1980 through 2012. I show how computational data mining techniques can identify and evaluate the significance of qualitatively distinct subjects of discussion across a wide range of public discourse. I also show how examining large textual datasets with computational methods can overcome methodological limitations of conventional qualitative methods, such as how to measure the impact of particular cases on broader discourse, how to validate substantive inferences from small samples of textual data, and how to determine if identified cases are part of a consistent temporal pattern.
منابع مشابه
Searching Large Textual Dataset With Limited Computational Resources
In this paper we propose a search approach that can process large volumes of textual data efficiently and effectively even in environments where computational resources are limited. The traditional search solution for large collections assumes availability of practically unlimited computational resources. For many applications and organization this assumption is not realistic. Empirical evaluat...
متن کاملMammalian Eye Gene Expression Using Support Vector Regression to Evaluate a Strategy for Detecting Human Eye Disease
Background and purpose: Machine learning is a class of modern and strong tools that can solve many important problems that nowadays humans may be faced with. Support vector regression (SVR) is a way to build a regression model which is an incredible member of the machine learning family. SVR has been proven to be an effective tool in real-value function estimation. As a supervised-learning appr...
متن کاملEvaluation of “Mosaic 1 Reading”: A Microstructural Approach to Textual Analysis of Pedagogical Materials
To analyze and evaluate textbooks, researchers have either proposed scales and checklists to be filled by teachers and learners or conducted qualitative investigations of the match between SLA theories and textbook activities. This study, however, employs the microstructural approach of schema theory to scrutinize the reading passages of “Mosaic 1 Reading”. To this end, 17 passages of the textb...
متن کاملA Neural Network Model to Solve DEA Problems
The paper deals with Data Envelopment Analysis (DEA) and Artificial Neural Network (ANN). We believe that solving for the DEA efficiency measure, simultaneously with neural network model, provides a promising rich approach to optimal solution. In this paper, a new neural network model is used to estimate the inefficiency of DMUs in large datasets.
متن کاملThe Study of Ideological Manipulation in Persian Translations of Noam Chomsky’s Media Control Based on Farahzad’s Translation Criticism Model
Abstract Critical Discourse Analysis as an interdisciplinary approach aims at making transparent the connections between discourse practices and social practices and provides ways of looking into translations from a critical standpoint.Farahzad is among the scholars who presented her specific CDA model inspired by Fairclough’s approach. The present Critical Discourse Analysis (CDA)-based s...
متن کامل